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Abstract 

The performance of algebraic soft-decision decoding of Reed-Solomon codes using bit-level soft information is 
investigated. Optimal multiplicity assignment strategies of algebraic soft-decision decoding with infinite cost are first 
studied over erasure channels and the binary symmetric channel. The corresponding decoding radii are calculated 
in closed forms and tight bounds on the error probability are derived. The multiplicity assignment strategy and 
the corresponding performance analysis are then generalized to characterize the decoding region of algebraic soft- 
decision decoding over a mixed error and bit-level erasure channel. The bit-level decoding region of the proposed 
multiplicity assignment strategy is shown to be significantly larger than that of conventional Berlekamp-Massey 
decoding. As an application, a bit-level generalized minimum distance decoding algorithm is proposed. The proposed 
decoding compares favorably with many other Reed-Solomon soft-decision decoding algorithms over various channels. 
Moreover, owing to the simplicity of the proposed bit-level generalized minimum distance decoding, its performance 
can be tightly bounded using order statistics. 

Index Terms 

Algebraic Soft-decision Decoding, Bit-level Soft Information, Error and Erasure Decoding, Generalized Minimum 
Distance Decoding, Guruswami-Sudan Decoding, Koetter-Vardy Decoding, List Decoding, Multiplicity Assignment, 
Reed-Solomon codes. 

I. Introduction 

Reed-Solomon (RS) codes are powerful error correction codes which are widely employed in many state-of-the- 
art communication systems. However, in most existing systems, RS codes are decoded via algebraic hard decision 
decoding (HDD) which does not fully exploit the error correction capability of the code. Efficiently utilizing the 
soft information available at the decoder input to improve the performance of RS codes is a long-time standing open 
problem. However, this can be computationally complex and in fact, it has recently been proved that maximum 
likelihood decoding (MLD) of RS codes is an NP hard problem [1]. Nevertheless, many efficient suboptimal soft 
decision decoding (SDD) algorithms have been developed. Some use the reliability value to assist HDD, [2]-[4]; 
some take advantage of the structure of RS codes at the bit-level to reduce the complexity [5], [6]; some apply 
decoding algorithms for general linear block codes to RS soft-decision decoding [7], [8]; and more recently, iterative 
techniques have also been proposed for RS soft-decision decoding [9]-[12]. 
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{jingj@qualcomm.com) was with Texas A&M University and Is now with Qualcomm Inc. Krishna R. Nai'ayanan (krn&ece.tamu.edu) is 
with Texas A&M University. 
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Since the seminal works by Sudan [13], Guruswami and Sudan [14] and Koetter and Vardy [15], algebraic soft- 
decision decoding (ASD) algorithms have gained research interest due to their significant performance improvement 
over HDD. From a practical perspective, many techniques have been proposed to steer ASD decoding towards 
an implementable alternative to conventional Berlekamp and Massey (BM) decoding, for example, [16]-[21] and 
references therein. We also refer interested readers to [22] for a comprehensive tutorial of the generic GS algorithm. 
From a theoretical perspective, however, optimal multiplicity assignment strategy (MAS) for ASD and corresponding 
performance analysis still remains an open problem. In [15], Koetter and Vardy presented an asymptotically optimal 
MAS that maximizes the transmission rate for a given channel such that the probability of error can be made 
arbitrarily small as the code length goes to infinity. Multiplicity assignment optimization for finite length RS 
codes has been considered in [23]--[25] using numerical algorithms. In [25], MAS for general discrete memoryless 
channels (DMC) has been investigated and an upper bound based on a Chernoff bounding technique has been 
derived. However, the Chernoff-type bound [25] largely relies on numerical computation and gives little insight into 
the decoding region of ASD under a certain MAS. Besides, the bound becomes loose for practical high rate RS 
codes. In [26], a general framework has been studied to optimize the multiplicity assignment for an additive cost 
function (e.g., Hamming distance, weighted Hamming distance and other error metrics). In fact, as one reviewer 
pointed out, the optimal decoding radius of ASD over the EEC and the BSC derived in Section III and Section 
IV can also be solved using this general framework proposed in [26]. However, the multiplicity optimization in 
[26] does not apply to practical noisy channels (such as additive white Gaussian noise (AWGN) channels) in an 
obvious way. More recently, the decoding region and typical error patterns of ASD with infinite cost over some 
basic DMC's have been studied independently in [27] and [28] (which can be viewed as more detailed discussions 
on some special cases covered by the general framework in [26]). Based on the analysis in [27], the performance 
of practical high rate RS codes can be tightly bounded over erasure channels. However, as suggested in [28], even 
with infinite cost, ASD has a significant gain over BM only for RS codes of low rates or when the number of 
candidates at the decoder input is small. 

In this paper, we propose a MAS for ASD that provides a significant performance improvement over the BM 
algorithm even for high rate RS codes with a computational complexity that is practically affordable. In contrast to 
the popular view point that ASD is a symbol-level SDD scheme, we study the performance of ASD of RS codes 
using bit-level soft information. We show that carefully incorporating bit-level soft information in multiplicity 
assignment and interpolation is the key step to achieve most of the coding gain offered by ASD but still maintain 
a moderate complexity. Based on the analysis, a new SDD scheme is proposed for RS codes, which outperforms 
many existing ASD algorithms in the literature in terms of both performance and computational complexity. 

The rest of this paper is organized as follows: After a brief review of ASD in Section II, we investigate optimal 
MAS's for ASD over erasure channels and binary synmietric channels (BSC) with infinite cost in Sections III and 
IV. The corresponding decoding region of ASD is characterized and performance bounds are derived. It is shown 
that ASD has a significant gain over conventional BM decoding over binary erasure channels (BEC) and most of 
the coding gain comes from appropriate multiplicity assignment to bit-level erasures. On the other hand, the gain of 
ASD over GS decoding is large only for short length or low rate RS codes over BSC's. In Section V, the analysis 
is generalized to a mixed error and bit-level erasure channel and a simple MAS is proposed. In the infinite cost 
case, the decoding region of the proposed MAS is shown to approach the outer bound of optimal MAS for practical 
medium to high rate RS codes. In the finite cost case, the decoding region of the proposed MAS is also characterized 
for any given multiplicity parameter M. By treating erasures at the bit-level, the proposed MAS has a significantly 
larger decoding region than that of conventional BM and more recent GS algorithm. Based on insights obtained 
from the performance analysis, in Section VI, we develop a sequential MAS called bit-level generalized minimum 
distance (BGMD) decoding, which successively erases the least reliable bits (LRB). In spite of its simplicity, BGMD 
algorithm provides a significant gain over conventional BM decoding and compares favorably with many existing 
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MAS's of ASD and other RS SDD schemes over various channels of practical interests. Moreover, due to its simple 
structure, the decoding performance of BGMD for practical high rate RS codes can be tightly bounded using a 
standard order statistics bounding technique. BGMD upper bound suggests a significant gain over BM decoding in 
the high SNR region, where the evaluation of the performance is beyond the capability of computer simulation but of 
significant practical value. Simulation results are presented in Section VII and conclusion is drawn in Section VIII. 

II. Reed-Solomon Codes and Algebraic Soft Decision Decoding 

In this section, we review some background materials on RS codes and ASD of RS codes that are relevant to 
this paper. Underlined letters will be used to denote vectors and bold face letters will be used to denote matrices 
throughout this paper. 

A. Evaluation Form of Reed-Solomon Codes 
Define the message vector g as: 

9 = {go,gi,---9K-i),9i€GF{q). (1) 

The polynomial form of the message is: 

g{x) =go + 9ix-\ h gK-ix^~^ (2) 

Let r = {71, 72, • • • , 7Ar} be an ordered set of N distinct elements in GF{q). An {N, K) RS codeword is defined 
by evaluating the message polynomial g(x) at the N distinct points in the ordered set T. The corresponding RS 
code is the set of all possible such evaluations: 

C(7V,jr,{7i,72,--- ,7Jv}) = {(5(7i),9(72),-- - ,9{7n))} (3) 
where g{x) e GFq[x] is a polynomial with maximum degree less than K. 

B. Algebraic Soft-Decision Decoding 

Let A{X, Y) = X]S:0 X^jlo O'ij^^Y'' be a bivariate polynomial over GF{q) and let Wx, Wy be nonnegative real 
numbers. The (w^;, w^)- weighted degree of a nonzero polynomial A(X,Y) (denoted as deg^^ ,j,^(A)) is defined 
as the maximum over all numbers iwx + jwy such that a^^ ^ 0. The (1, 1) degree is usually referred to as the 
degree of the polynomial A{X, Y) (denoted as dQq{A)). The bivariate polynomial A{X, Y) is said to pass through 
a point (a, /3) with multiplicity m (or equivalently, the point (a, /3) is said to be a zero of multiplicity m of the 
polynomial A{X, Y)), if A{X + a, F + /3) contains a monomial of degree m and does not contain any monomials 
of degree less than m. 

Suppose an RS codeword is modulated and transmitted through a memoryless channel. Let X_ = {Xi , A2 , • • • , X^) 
be the random variable of the transmitted codeword and x= {xi,X2,- " > xn) be the realization of the random 
variable. Let 3^ = (3^i, 3^2, • • • , J^jv) be the random variable of the channel output and y = (2/1, 1/2, • " " , Vn) a 
particular realization of the channel output. The a posteriori probability (APP) of each symbol without considering 
the constraint imposed by the RS code can be computed from the channel output as: 

= Pr {Xj = ai\yj = yj),l<i<qA<j <N (4) 

where {ai, a2, ■ ■ ■ , aq} are all distinct elements in GF{q). Define the qx N reliability matrix 11 as a matrix with 
entries {T^ij} as computed in (4). 11 serves as a soft input to an ASD decoder. The generic algorithm of ASD as 
in [15] is described in the following 4 steps: 

Multiplicity Assignment: Compute the multiplicity matrix M with integer entries {Mij} based on the reliability 
matrix n according to a particular multiplicity assignment strategy. 
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Interpolation: For a given multiplicity matrix M with entries {Mi j}, construct a bivariate polynomial A{X^ Y) 
of minimal {\,K — l)-weighted degree that passes through each point (7^, aj) with multiplicity at least Mij, for 
i = l,2,--- ,gand j = l,2,--- ,N. 

Factorization : Find all polynomials g{X) such that {Y — g{X)) divides A{X,Y) and deg{g{X)) < K. Form 
a candidate codeword list by re-encoding all such polynomials g{X). 

Codeword Selection : If the candidate codeword list is not empty, select the most Ukely codeword from the list 
as the decoder output; otherwise, declare a decoding failure. 

Intuitively, the idea of ASD is to take advantage of soft information and assign higher multiplicities to more 
probable symbols such that the decoder has a better chance to find the correct codeword. 

C. Performance of Algebraic Soft-decision Decoding 
Define the inner product between two qx N matrices as: 



q N 

(A,B) =^trace(AB^) = ^^aij&ij (5) 

Let 1 be the all-one q x N matrix. Suppose the vector x represents an RS codeword, let [x] be the codeword 
matrix with entries \x\^J defined as: \x\^J = 1 if xj = \x}^j = 0, otherwise. As in [15], the score and cost are 
defined as follows. 

Deflnition 1 The score Sm{x) with respect to a codeword x for a given multiplicity matrix M is defined as: 

Sm{x) = (M, \x]) (6) 

Definition 2 The cost Cm of a given multiplicity matrix M is defined as the number of linear constraints imposed 
by M: 

q N 

= 2 + 1) = (M,M-H)/2 (7) 

It is understood that the score and the cost are associated with a particular multiplicity matrix M. Besides, unless 
otherwise specified, the score is taken with respect to the transmitted codeword. Therefore, for brevity, we use S 
to denote Sm{3L) and C to denote Cm throughout the rest of this paper 

Similar to other list decoding algorithms, the probability of error of ASD can be upper bounded using the union 
bound: 

Pasd < Pust + Pml (8) 

where Pust is the probability that the transmitted codeword is not on the list and Pml is the probability that the 
maximum likelihood decision is not the transmitted codeword. It is known in the literature (see [12] [15] [16]) that 
Phist 3> Pml for many practical high rate long RS codes. Therefore, we approximate Pasd ~ Plisi throughout 
the rest of this paper (see also [25] [28]). However, for low rate codes, Pml must be computed to evaluate Pasd 
(see [29] for details). In general, the decoding region of ASD is difficult to characterize and analytical computation 
of PList is a tough problem. However, it is shown in [15] that the transmitted codeword is guaranteed to be on the 
list when the following sufficient condition is satisfied: 

Lemma 1 [15] Finite cost: A sufficient condition for the transmitted codeword to be in the list is: 

) + l 



5 > min U e Z 



K-l 



K-l 



>C\ (9) 
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The proof of Lemma 1 is given in Theorem 3 in [15]. The above sufficient condition can also be expressed as (see 
also [22]): the transmitted codeword will be in the list if 



T{S) > C (10) 
a 
2' 



where r(S') = (a + 1) S - ^{K - 1) ,a{K - 1) < S < {a + 1){K - 1), a = 0,1, 



is a piecewise linear function. 

Generally speaking, larger cost leads to better decoding performance, while it also increases complexity (though 
the performance of ASD does not monotonically improve as the cost increases). As the cost goes to infinity, we 
can further simplify the sufficient condition as: 

Lemma 2 [15] Infinite cost: A sufficient condition for the transmitted codeword to be in the list as C ^ 00 is: 



S> ^J2{K-1)C (11) 

See Corollary 5 in [15] for the proof. 

Usually, the sufficient conditions (9) and (11) become tight approximations when N is large (see [15], [30]). 
With a little bit abuse of terminology, we have the following definition: 

Definition 3 For ASD with finite cost, a received word is said to be certainly decodable if and only if the sufficient 
condition (9) is satisfied. 

Definition 4 For ASD with infinite cost, a received word is said to be certainly decodable if and only if the sufficient 
condition (11) is satisfied. 

For the rest of the paper, we approximate the actual decoding error probability of ASD by the probability that a 
codeword is NOT certainly decodable. An ASD error is declared when the received word is not certainly decodable 
as defined in Definition 3 and Definition 4 for finite cost and infinite cost cases respectively. Practically speaking, 
though the decoder may stiU be able to list the transmitted codeword even when the sufficient condition is not 
satisfied, the probability is very small and the approximation is tight for long codes, which are used in many 
existing standards (see [15], [16] and [30]). 

D. Binary Image Expansion of Reed-Solomon Codes 

RS codes can also be viewed as non-binary Bose-Chaudhuri and Hocquenghem (BCH) codes. The parity-check 
matrix of a narrow sense 'RS{N, K) code over GF(g) with a minimum distance dmin = N — K + 1 can be 
represented by: 



H, 



/I a ■■■ a(^-i) \ 
1 ••• a2(iv-i) 



(12) 



where a is a primitive element in GF{q). 

RS codes over GF(2'") (which are of most practical interest) can be represented using equivalent binary image 
expansions [31]. Let n = N x m and k = K x mhe the length of the codeword and the message at the bit-level, 
respectively. Hg has an equivalent binary image expansion H{, (see [31] for details), where H;, is an {n — k) x n 
binary parity check matrix. In other words, an RS code can also be viewed as a binary linear block code. In 
many practical systems, RS codewords are mapped into their binary image expansions and then transmitted through 
channels using binary modulation formats. Therefore, it is of both theoretical and practical value to study ASD 
using bit-level soft information. 
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III. Performance Analysis of Algebraic Soft-decision Decoding over Erasure Channels 
In this section, we consider MAS's for erasure channels and their corresponding performance analyses. 

A. Algebraic Soft-decision Decoding over the Binary Erasure Channel 

We first consider the case when RS codewords are transmitted as bits through a BEC with erasure probability e. 
Note that we are interested in both the truly optimal multiplicity assignment, (namely, for a given cost, the score 
is maximized) and the asymptotically optimal multiplicity assignment that maximizes the number of correctable 
bit-level erasure in the worst case (which can also be solved from the general formula in [26]). However, it should 
be noted that the MAS we are considering here does not take into account the inherent structure of RS code, 
which is extremely challenging [15]. As a result, similar to the "adversarial channel optimal"argument in [15], [25], 
[24], we assume that the symbols in a codeword are independent and identically distributed (i.i.d) with a uniform 
distribution over GF{q) during the multiplicity assignment stage. Along the way of these derivations, we also gain 
some insight on the dominating erasure patterns over the BEC. 

We first define the "candidate" of a symbol and the "type" of a symbol as follows: 

Definition 5 A binary m-tuple ai € {0, l}™ is a candidate for a received m-tuple symbol yj G {0, 1, e}™ if each 
non-erased bit of yj agrees with the corresponding bit of ai. 

Definition 6 The type of a received m-tuple symbol yj G {0, 1, e}™ over the BEC is the number of bit-level 
erasures in yj. 

For BEC's, according to the "optimal for the adversarial channel model" argument, a natural MAS is to assign 
equal multiplicity to each candidate in the same symbol. Consequently, there are 2' candidates for a symbol of type 
i. For a codeword over GF{2'^), there are at most {m-\- 1) types of symbols. Let be the number of symbols of 
type z in a received word. As discussed above, we will assign the same multiplicity to each candidate in the same 
symbol. Moreover, we assume the same multiplicities are assigned to all candidates of the same type in a received 
word; whereas, the multiplicity assigned to type i may vary according to the received word. This assumption will be 
justified later. Let be the multiplicity assigned to each candidate of type i. Thus, the total multiplicity assigned 
to one symbol of type i is 2'mi. The score (with respect to the transmitted codeword) and cost are: 



m 

S = ^ajmi (13) 
C = t'^M'^'^') (14) 

^ m 

C = - ^ai2'm2(l + o(l)), mi ^oo (15) 
<=o 

The approximation in (15) becomes tight when rrii becomes large. We will derive an upper bound and a lower 
bound on the probability of ASD error with infinite cost as defined in Definition 4. Furthermore, we consider ASD 
with infinite cost such that we can relax the multiplicities from being integers to real numbers. It is justified by 
the fact that rational numbers are dense on the real axis and every finite collection of rational numbers can be 
scaled up to integers if there is no cost constraint. Hence any real number multipUcity assignment scheme can be 
approximated arbitrarily close with integer number multiplicity assignment scheme (see also [24]). 

Following [15] [16], we define proportional multiplicity assignment strategy (PMAS) as follows: 
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Proportional Multiplicity Assignment Strategy: For a given total multiplicity per symbol M, PMAS assigns 
multiplicity proportional to the APP of that candidate, i.e., the multiplicity we assign to ak in the symbol 
of the received vector is Mk,j = \_-Kk,jM\, where M is a predetermined real number. 

For example, in the case of BEC, Mj-^j = \_'Kk,iM\ = [2~'MJ if ak is a candidate for the j^^ symbol, which is 
a type i symbol; Mkj = if cannot be a candidate for the j"* symbol. PMAS defined above is equivalent to 
the simplified KV defined in [16]. Note that there is a quantization error, however, the multiplicity assignment is 
asymptotically proportional to the APP as the cost approaches infinity. We will show in the following that PMAS 
is optimal over the BEC with infinite cost: 

Theorem 1 The proportional multiplicity assignment strategy (PMAS) is optimal over the BEC regardless of the 
received signal, i.e., PMAS maximizes the score for a given cost over the BEC. 

Proof: Assume that the received word has symbols of type i, the MAS can be formulated as maximizing 
the score with a cost constraint. With infinite cost, the problem is expressed as: 



max S = a 

{m,} ^ 
1=0 



m, (16) 



subject to C « - ^ ait ml < Co 



2^=0 

This is a standard optimization problem with Unear cost function and quadratic constraint. Using a Lagrange 
multipUer, the new objective function becomes 

m ( \ ^ \ 

L=-^aimi + A -^2Vimf-Co (17) 

j=0 V i=0 / 

Take the partial derivative with respect to mi and set it to zero. We have: 

= -tti + Xl'aimi = (18) 



dmi 

Therefore we have = i.e., cx 2^% which proves that PMAS is optimal. ■ 
Note that mj does not depend on a^. Even without the assumption that equal multiplicities are assigned to candidates 
of the same type, we still get = ^ for all type i candidates, i.e., PMAS is optimal over the BEC. 

Since PMAS is optimal over the BEC, we will from now on assume that PMAS is used. Under PMAS, we 
assume that the total multiplicity for each symbol is M. Consequently, the score is So = X]™ q ai2~^M = rjM 
and the cost is Co = 5 Yh^o ai2~'M^(l + o(1)) = ^r]M'^{l+o{l)), where 77 = Yh^o '^i^"* is a positive number. 
The sufficient condition of (11) becomes: 



So > V^K - l)Co (19) 
rj>K-l (20) 

When K = 1, under PMAS, > 0, the transmitted codeword will always be on the decoding list. From now on, 
we only consider the case K > 1. 

We study the worst case bit-level decoding radius of ASD under PMAS with infinite cost over the BEC. We 
need the following lenmias. 

Lemma 3 Over the BEC, if a received word y is certainly decodable under PMAS with infinite cost, then any 
other word y', in which only a subset of the erased bits in y are erased, is always certainly decodable. 



Proof: The proof is iimnediate by the fact that if some of the erasures are recovered, r] will increase and as 
can be seen from (20), the decoding performance monotonically improves as rj increases. ■ 
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Lemma 4 Over the BEC, given f bit erasures, the worst case erasure pattern for ASD under PMAS with infinite 
cost is that all erased bits are spread in different symbols as evenly as possible. That is: {N — f + L;^J-^) symbols 
contain [-^J bit erasures and (/ — [-^JiV) contain f-^] bit erasures. 



Proof: Take two arbitrary symbols of type i and j, if we average the bit erasures between these two, we get 
two symbols of type [^J and • The updated 77' can be expressed as: 

77' = 77 + 2- L4^J + 2- r^i^l - 2"* - 2-J' < 77 (21) 

Since rj > -q' and again according to (20), the latter erasure pattern is worse. By repeating the above procedure, 
we can finally get the worst case erasure pattern of ASD under PMAS, i.e., the bit erasures are spread as evenly 
as possible in different symbols. ■ 
According to Lenrnia 3 and Lemma 4, the bit-level decoding radius of PMAS can be characterized. 

Theorem 2 Over the BEC, every pattern of f bit erasures can be corrected with ASD under PMAS with infinite 
cost if the following is satisfied: 

f < {i + 1)N -T{K- 1), for 2-^ + ^ ~ ^'^'^ <R< 2-^'-''> + = 1, 2, • • • , m (22) 

Especially, for high rate RS codes, we have: 

f < 2N-2iK-l),forR>^ + ^ (23) 

Proof: According to Lemma 4, the worst case erasure pattern is aU erased bits spread evenly over different 
symbols. First consider the case f < N, (20) becomes: 

{N -f) + lf>K-l (24) 

The corresponding rate region must satisfy the constraint that when / = TV + 1, in the worst case rj < K — 1. We 
get K > + ^ in this case. Altogether, we get the bit-level decoding radius for the high rate case: 

/<2iV-2(i^-l), wheni?> i + A (25) 

Similarly, when the decoding radius {i - 1)N < f < iN, we must have 2"^ + 1 - 2-(*+i) <K< 2-('-i)Ar -|- 
1 — 2~', where i = 1, 2, • • • , m. We can obtain the exact decoding radius for all these cases: 

f <{i + l)N- 2\K - 1), when 2-'N + 1 - 2''^'+'^^ <K < 2-^'-'^'^N + 1-2-' (26) 

■ 

According to Theorem 2, any erasure pattern with / < 2 (TV — K + 1) is certainly decodable. We can get an upper 
bound on the frame error rate (FER) of ASD under PMAS with infinite cost. 

Corollary 1 For RS codes of rate ^ > 5 + 377 over the BEC, a received vector is NOT certainly decodable under 
PMAS with infinite cost when there are more than 2{N — K) + 1 symbols containing erased bits. 

Proof: The coroUary follows from (23) and Lemma 3. If there are more than 2(A'^ — i^) -|- 1 symbols having 
erased bits, the most optimistic case is that these symbols are of type 1. Besides, due to (23), the sufficient condition 
is not satisfied as defined in Definition 4. ■ 
Theorem 2 gives an upper bound on the FER performance over the BEC and Corollary 1 provides a lower 
bound. These bounds are shown in Figure 1 in conjunction with the union bound on the averaged FER of a 
maximum likelihood (ML) decoder over the RS ensemble [32]. Note that ASD has a significant performance gain 
over conventional BM erasure decoding. This is intuitive since we do not have to erase the whole symbol if some 
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Fig. 1. Bounds and Simulation Results of ASD for RS(255,239) over the BEC 



bits in the symbol are erased, which can be taken advantage of by ASD. It can be seen from the figure that for 
practical high rate long codes, both the upper and lower bounds are tight and they together accurately indicate the 
performance of ASD. Also note that the averaged performance of the ML decoder over the RS ensemble is very 
close to the capacity of the BEC, which shows that RS codes are good codes. 

The tightness of the upper bound and the lower bound of ASD motivates the following proposed MAS: 
Proposed Multiplicity Assignment Strategy: In each received symbol, we assign mo = M if that symbol does 
not contain erased bits, assign mi = M/2 if the symbol contains 1 bit-level erasure and assign zero multiplicity 
for symbols containing more than 1 bit-level erasures, that is to set rrij = 0,j = 2, - ■ ■ , m. 

Remark 1 Since erasing 2 bits in the same symbol leads to the same score but less cost than 2 bits in two different 
symbols,the worst case erasure pattern of the proposed MAS for RS codes with R> ^ + is that all bit-level 
erasures are spread in different symbols. According to Theorem 2, the proposed MAS can recover any bit-level 
erasures containing less than 2{N — K + 1) bit erasures. Essentially, the proposed MAS takes care of the worst 
case erasure pattern only and it is asymptotically optimal in terms of achieving the largest worst case decoding 
radius. Consequently, the PER upper bound derived in Theorem 2 for R> ^ + 4^ is also a valid upper bound 
for the proposed MAS. Though the proposed MAS is not optimal as PMAS, the loss is quite small by comparing 
the upper bound of the proposed MAS and the lower hound of PMAS for high rate RS codes. It can also be seen 
from Figure 1 that the simulation results of the proposed MAS and optimal PMAS are very close. 

B. Extension to T^-ary Erasure Channels 

We extend the result in Subsection III-A to 2"-ary erasure channels, i.e. u coded bits are grouped together and 
transmitted using a symbol (it can be QAM or PSK modulation format). The channel will erase the signal 

with erasure probability e at 2''-ary symbol-level. Practical channels of this model were discussed in [33]. 

In Subsection 111-A, we showed that PMAS is optimal for erasure channels. Clearly, all erasure patterns in this 
2"-ary erasure channel model is a subset of erasure patterns of BEC. Therefore, with infinite cost, PMAS is still 
optimal for this channel model. Here, we only consider the case when u divides m, i.e., m = vu. Thus, for each 
symbol, we have {v -\- 1) types. 
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Lemma 5 Over the 2'^-ary erasure channel, the worst case erasure pattern for ASD under PMAS with infinite cost 
is that all erasure events are spread in different symbols as evenly as possible. 

Proof: Assume two RS symbols are of type i and j, we can average the erasure events between the two 
symbols, we have: 

7]' =r] + + 2-^"*^!" - 2"™ - 2"^" < r? (27) 

Similar to Lemma 4, spreading erasure events in different RS symbols evenly gives the worst case. ■ 

Theorem 3 Over the 2^-ary erasure channel, ASD under PMAS can guarantee to decode up to f < {N — K + 
1)/(1 - 2-") 2"-ary symbol-level erasures if R> 2"" + i+2-"^-2-" 

Proof: According to Lemma 5 and (20), spreading erasure events in different symbols is the worst case erasure 
pattern M K > 2-"iV + 1 + 2-2" - 2"", that is i? > 2"" + 1+2-^^-2-" _ -^hus rj = N - {1 - 2"")/. According 
to (20) when the following condition is satisfied: 

/ < (iV - + 1)/(1 - 2-") (28) 

the transmitted codeword is guaranteed to be on the list. ■ 

Remark 2 Note that (28) is a generalization of Theorem 2 (with u = 1 as a special case). As u becomes larger, 
the asymptotical 2"-ary symbol-level decoding radius gets smaller As m, f gets close to the conventional BM 
erasure decoding region. 

IV. Performance Analysis of ASD over the BSC 

In this section, we study the performance of ASD over BSC's. For BSC's, both the transmitted and the received 
symbols are in GF{q), i.e., xj G GF{q) and j/j <E GF{q), for j = 1, - ■ ■ .N. In this case, bit-level reliabilities are 
the same for all received bits. However, symbol-level soft information can be utilized by ASD under proper MAS. 
In the BSC case, the "type" of a symbol is related to its bit-level Hamming distance between xj and tjj. 

Definition 7 The type of a received m-tuple symbol yj e {0, 1}"* over the BSC is defined as the bit-level Hamming 
distance between xj and yj. 

Let flj denote the number of symbols of type i. Again, the same multiplicity is assigned to candidates of the same 
type, that is, we set Mkj = mH(k,j), where H{k,j) is the Hamming distance between symbol and the received 
symbol yj. For example, we assign mo to the received symbol, mi to all the m symbols which differ from the 
received symbol in 1-bit position and so on. Note that M^j is the multipUcity assigned to a/, in the j'*^ symbol and 
rrii is the predetermined multiplicity we wiU assign to candidates of type i. However, unlike the BEC case, the type 
of the transmitted symbol is unknown at the receiver, the truly optimal MAS is not easy to derive as in the BEC 
case. Therefore, we resort to the asymptotically optimal MAS, i.e., maximizing the bit-level decoding radius for the 
worst case error pattern. It can also be easily justified that non-uniform multiplicity assignment among candidates 
of the same type is strictly suboptimal in terms of achieving the largest decoding radius for the worst case error 
pattern, since the worst case error pattern will always correspond to the candidates with the smallest multipUcities. 
The MAS optimization problem can be formulated as a max-min problem over {ui} and {mj}. 
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max mm e 

{rrii} {aj 



= ^icii-l (29) 
s. t. aiiTii < 







fli = A/', where a, are non-negative integers 

where e = 'J^iLo i'^i ^ 1 is the number of bit errors, S = X^tlo is the score, C = YliLa (™) is the cost 
(infinite cost approximation to be more exact) over the BSC and J2^Lo is the total number of symbols of the 
RS codeword. It can be seen that (29) maximizes the bit-level error decoding radius of ASD in the worst case. 

Note that the optimization problem in (29) is subsumed by the general formula in [26], as the bit-level errors 
can also be viewed as an additive cost. In this paper, we take a different approach to derive the optimal decoding 
radius (another similar derivation can be found in [28]). We first take one step back and consider a special case of 
BSC, called 1-bit flipped BSC, i.e., in each symbol, at most 1 bit is in error. By doing that, we only have to assign 
multipUcities to two types of candidates. An optimal MAS for this 1-bit flipped BSC is to assign M to y/s and 
tM to all their 1-bit flipped neighbors. The asymptotically optimal decoding radius Cmax and the corresponding 
optimal t can be computed in close forms. The derivations are given in Appendix I. 

It should be noted that for the 1-bit flipped BSC, the performance improvement of ASD over GS is significant 
only when the rate is low. For instance, for N = 255, K = 223, ASD does not increase the bit-level error correcting 
radius, for N = 255, K = 167, ASD gives an extra error correction capabiUty over GS decoding at the bit-level, 
for N = 255, K = 77, it corrects 7 more errors and for N = 255, K = 30, it corrects 45 more errors. For K < 30, 
all errors can be corrected for the 1-bit flipped BSC. 

Now, we show that the above proposed MAS is also asymptotically optimal for RS codes over the BSC under 
certain conditions, which are satisfied for a wide range of code rates. We begin with the following Lemma. 

Lemma 6 Over the BSC, the worst case error pattern for the proposed MAS with infinite cost is all erroneous bits 
spread in different symbols, if the total number of bit errors e < N and the optimal multiplicity coefficient t <\. 

Proof Assume e< N bits get flipped by the channel. If bit errors are spread in different symbols, as the 1-bit 
flipped BSC case, the score can be expressed as: 



S = M[{N - e) + te] (30) 

The cost of ASD for the BSC does not change when the MAS is fixed. For a given number of bit errors, the worst 
case error pattern minimizes the score of the MAS. In the above MAS, multiplicities are assigned only to y^'s and 
their 1-bit flipped neighbors. Thus, a potentially worse error pattern than the 1-bit flipped BSC is to group bits in 
some 1 -bit-flipped symbols to reduce the score. 

Let the worst case error pattern have e' symbols containing 1-bit error and e" symbols containing more than 
1-bit errors. Evidently, for symbols containing more than 2-bit errors, we can always further decrease the score of 
the proposed MAS for 1-bit flipped BSC by splitting these bit errors into one symbol containing 2-bit errors and 
the other containing the remaining errors. Consequently, the worst case error pattern will contain symbols with at 
most 2-bit errors. We have e' -|- 2e" = e. The score becomes: 



S" = M[{N -e' - e") + te'] = M[{N - e) + et + e"(l - 2t)] 



(31) 
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When t < ^, S" > S, which proves that spreading all erroneous bits in different symbols is the worst case error 
pattern for the proposed MAS over the BSC. ■ 

Theorem 4 In the infinite cost case, the proposed optimal MAS for the 1 -bit flipped BSC is also optimal for the 

BSC if the optimal decoding radius of ASD over the 1 -bit flipped BSC Cmax < N (as given in (69)) and the 
corresponding optimal multiplicity coefficient t < ^. The optimal bit-level decoding radius of ASD over the BSC is 

also Grnax- 

Proof: According to Lemma 6, over the BSC, all erroneous bits spread in different symbols is the worst 
case error pattern for the proposed MAS if e < iV and t < ^, which is nothing but the 1-bit flipped BSC's. On 
the other hand, the proposed MAS derived in Appendix I is asymptotically optimal for the 1-bit flipped BSC, i.e., 
maximizing the worst case decoding radius e^ax- Consequently, the proposed MAS wiU guarantee to decode aU 
error patterns with no more than e„iaa;-bit errors over BSC's as well. ■ 
The error correction radius of optimal MAS as a function of t over an RS(255, 55) code is given in Figure 2. It 
can be seen that optimal MAS (which is achieved by t = 0.2) corrects 13 and 50 more bit-level errors than GS and 
BM in the worst case. Besides, we also plot bit-level radius of PMAS as a function of the crossover probability 
Pb of the BSC. Note that PMAS is not asymptoticaUy optimal for the BSC here. Even though we choose pb to 
maximize the bit-level radius (around pb = 0.13), the bit-level decoding radius is still 1 bit smaller than that of 
optimal MAS. The reason can be explained as follows: the worst case error pattern of this BSC is shown to be 
all bit-level errors spread in different symbols, thus, the proposed asymptotically optimal MAS only has to assign 
multiplicities to candidates of type and type 1. On the other hand, PMAS assigns multiplicities proportionaUy. 
Thus it also assigns multiplicities to candidates of all types and unnecessarily spends more cost, which makes it 
suboptimal in terms of achieving the worst case bit-level decoding radius. 



160 




Fig. 2. Bit-level decoding radius of an RS (255,55) code 

We consider the performance of this asymptotically optimal MAS using a toy example shown in Figure 3. 
Consider the performance of an RS(7, 3) code over the BSC. For this code, the decoding radii of conventional BM 
and GS algorithm are 2 and 3 respectively. Note that, the bit-level decoding radius of PMAS can also be optimized 
over the crossover probability. However, in this case, the optimal bit-level radius of PMAS is stiU 3, while the 
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proposed optimal MAS, on the other hand, can achieve bit-level decoding radius 4. We can see from Figure 3, the 
performance upper bound of ASD under the proposed optimal MAS outperforms GS and BM. 




Fig. 3. Upper Bounds of ASD for RS (7,3) code over the BSC 

Remark 3 Over BSC's, the gain of ASD with infinite cost over GS decoding is very little for practical high rate RS 
codes. Besides, simply increasing the bit-level decoding radius may not necessarily lead to better performance at a 
moderate FER level. Since GS is a symbol-level decoding algorithm, it may be able to correct more typical error 
patterns at a moderate FER level than a bit-level decoding algorithm with a slightly larger bit-level decoding radius 
and hence leads to a better performance at that FER level than ASD with the proposed asymptotically optimal 
MAS. 

V. Bit-level Decoding Region of Algebraic Soft-decision Decoding Algorithm 

In this section, we generalize the analysis in previous two sections to a bit-level error and erasure channel. 
A simple MAS is proposed and its properties are studied. The decoding region of the proposed MAS in terms 
of the number of errors e and the number of bit-level erasures / is investigated for both infinite and finite cost 
cases. Finally, we show that the decoding region of the proposed MAS monotonically enlarges as the multiplicity 
parameter M increases. 

A. The Proposed MAS for a Mixed Error and Erasure Channel 

We first propose a MAS for the mixed channel, which is motivated by the analysis of the previous two sections. 
In Section HI, a simple proposed MAS has been shown to have nearly the same performance as optimal PMAS for 
high rate RS codes over the BEC. On the other hand, as shown in Section IV, ASD even with an optimal MAS has 
hardly any gain over GS decoder for high rate RS codes over BSC's. Altogether, these results suggest that most 
of the gain of ASD for high rate RS codes is from taking care of 1-bit erased symbols. Therefore, we expect to 
obtain most of the gain of ASD over other channels comes from assigning nonzero multiplicity only to symbols 
having zero or one bit erasure. We have the following proposed MAS: 
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Proposed Multiplicity Assignment Strategy: In each received symbol, we assign mo = M if that symbol does not 
contain erased bits, assign mi = M/2 if the symbol contains 1-bit erasure and assign multiplicity zero for symbols 
containing more than 1-bit erasures, that is to set ruj = 0, j = 2, • • • , m. 

Under the proposed MAS, there are 5 types of symbols, which are listed below with their corresponding score 
per symbol Sb and cost per symbol Cb'- 

(T-1) correctly received symbol: Sb = M and Cb = ^^''+^ 

(T-2) symbol got erased at symbol-level: Sb = and Cb = 

(T-3) erroneous symbol without erasure: Sb = and Cb = ^ 2^ 

(T-4) 1-bit erased symbol without other errors: Sb = f and Cb = M!±2m 

(T-5) 1-bit erased symbol with other errors: Sb = and Cb = ^ 

As before, we first characterize the worst case error pattern for the proposed MAS, which dominates the 
performance for high rate RS codes. We first have the following lemma: 

Lemma 7 Under the proposed MAS, an ASD error occurs over the mixed channel if S < M{K — 1). 

Proof: When S < M{K - 1), T{S) < T{M{K - 1)). Since T{M{K - 1)) = {M + 1)M{K - l)/2 and 
T{S) is a convex function, it is easy to verify the following upper bound on T{S): 

T{S)<^{M + 1)S, 0< S < M{K -1) (32) 

Considering all types of symbols, we have ^{M + 1)Sb < Cb- Therefore, for any received word, we have the 
following: 

T{S)<^{M + 1)S<C (33) 

■ 

Next, we show that recovering bit-level erasures in error-free symbols improves the performance monotonically. 

Lemma 8 Over the mixed channel, suppose a received vector y is certainly decodable by ASD under the proposed 
MAS with multiplicity parameter M. Any other received vector y' with only a subset of symbols, which are in error 
or erased in y, in error or erased, is certainly decodable under the same condition. 

Proof: Recovering 1 bit-level erasure from an error-free symbol can be of the following 3 cases: 

1) From a symbol with more than 2-bit erasures: A^* = and AC = 

2) From a symbol with exactly 2-bit erasures: AS = ^ and AC = M!±2M 

3) From a symbol with exactly 1-bit erasure: AS = ^ and AC = ^ 

Obviously, Case 1) is certainly decodable. For Case 2) and Case 3), let S, C and S", C be scores and costs 
before and after erasure recovering. We have a{K — 1) < 5* < (a + 1){K — 1), where a is an integer. Since 
T{S) > C and according to Lemma 7, we must have a + 1 > M. Since T{S) is a piecewise linear function with 
monotonically increasing slope, we have: 



T{S') = t(^S + ^ 



M 

> T{S) + {a+l)- 

> ^(/) + ^ 

> Cif) + ^^>Cif) (34) 



where (34) is due to the fact that M > 2 in the proposed MAS. 
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The following lemma shows that spreading bit-level erasures in different error-free symbols results in a worse 
performance than putting them in the same symbol. 

Lemma 9 Over the mixed channel, suppose a received vector y has a symbol containing more than 1 bit-level 

erasures and we move 1 bit-level erasure from this symbol to a correctly received symbol to get another received 
vector y' . If the resulting vector y' is certainly decodable using the proposed MAS with multiplicity parameter M, 
then the original received vector y is also certainly decodable under the same condition. 

Proof: Moving 1 bit-level erasure from a symbol with more than 1-bit erasure to a correctly received symbol 
can be of the following 3 cases: 

1) From a symbol with more than 2 erasures, A5 = and AC = — ^ 

2) From a symbol with exactly 2 erasures and no errors, AS = and AC = 

3) From a symbol with 2 erasures and some errors, AS' = — -y^ and AC = ^- 

Case 1) is nothing but adding 1-bit erasure in a correct symbol. As shown in Lemma 8, it results in no better 
performance. In Case 2) and Case 3), moving 1-bit erasure to correct symbols leads to no larger scores but a larger 
costs, therefore, it will also result in no better performance. ■ 

With the above lemmas, we now characterize the worst case error and erasure pattern. 

Theorem 5 Over the mixed channel, the worst case error and erasure pattern for the proposed MAS is that all 
bit-level errors are spread in different erasure-free symbols and bit-level erasures are spread evenly in the remaining 
symbols. Besides, if the worst case pattern received vector y with e errors and f erasures is certainly decodable 
under the proposed MAS with multiplicity parameter M, any received word y' with e' < e bit-level errors and 
f < f bit-level erasures is certainly decodable under the same condition. 

Proof: In the worst case, errors should obviously be spread in different symbols. Besides, having erasures in 
erroneous symbols will lead to the same score, but a smaller cost. Hence, in the worst case, errors and erasures 
should also be spread in different symbols. If the number of errors e and the number of bit-level erasures / satisfy 
e -\- f < N, according to Lemma 9, putting erasures in a correctly received symbol is the worst case. Applying 
Lemma 9 recursively, in the worst case, bit-level erasures are spread in different symbols. If e-\- f > N, putting 
more than 2 bit-level erasures in the same symbol essentially reduces the number of bit-level erasures in error-free 
symbols and according to Lemma 8, it always leads to no worse performance. As a result, when e + / > iV, in 
the worst case, we must have errors, 1-bit erased symbols and 2-bit erased symbols occupying all N symbols of 
the received word. 

On the other hand, fewer errors will lead to better performance in the worst case. Erasures will only participate 
in error-free symbols in the worst case. According to Lemma 8, fewer bit-level erasures in error-free symbols leads 
to no worse performance. In conclusion, for any received word, the worst case for the proposed MAS is that all 
errors are spread in different erasure-free symbols and erasures are spread evenly in the remaining symbols. Besides, 
reducing the number of bit-level errors e or the number of bit-level erasures / will not degrade the worst case 
performance. ■ 

Theorem 5 characterizes the worst case error and erasure pattern, which makes the decoding region analysis 
easier. 

Corollary 2 Over the mixed channel, the score and the cost of the proposed MAS with multiplicity parameter M 
in the worst case can be expressed in terms of the number of errors e and the number of bit-level erasures f as 
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follows: 

S={N-e-f/2)M (35) 
M 

C<{2N-f)—+N— (36) 

Proof: The corollary is immediate by considering the worst case error and erasure pattern in Theorem 5 in 
both e + /<iVande + />A'' cases. ■ 

Corollary 3 Over the mixed channel, an ASD error occurs under the proposed MAS if f > 2(iV — {K — 1) — e) 

Proof: The corollary is obtained by combining Lemma 7 and Corollary 2. ■ 
Corollary 3 suggests that an ASD error occurs under the proposed MAS before all error-free symbols get erased 
at the symbol-level. Besides, it also gives an outer bound on the decoding region of the proposed MAS. The exact 
decoding region of the proposed MAS will be studied in more detail in the following subsection. 



B. Infinite Cost Performance Analysis 

Due to the simplicity, the decoding region of this proposed MAS for medium to high rate RS codes can be 
characterized analytically. First, we consider the infinite cost case. 

Theorem 6 Under the proposed MAS with M ^ oo, the decoding region over the mixed channel in terms of e 
and f when e + f < N is: 

e<N-f/2- ^{K-l){N-fl2) (37) 
Proof: When e -|- / < A?^, in the worst case the score and the cost can be expressed as 

S = {N-e-f/2)M (38) 
C = l/4M^{l + o{l)){2N - f) (39) 

Plugging in (11), we can get: 

e<N-f/2- ^/{K-l){N-f/2) (40) 

■ 

According to Corollary 2, when e + f > N, (40) is still achievable and the actual decoding region can be larger. 
When / = 0, the above region becomes the maximum error correcting radius of GS decoding; when e = 0, we 
can obtain the worst case bit-level decoding radius derived in (23). 

To get an idea on how good this proposed MAS is, we derive an outer bound on the optimal decoding region of 
ASD with infinite cost. Using a technique similar to that used in Section IV, we first derive an optimal MAS over 
a 1-bit flipped or erased channel. That is, we assume in each symbol of the RS codeword, there is at most either 
1 bit in error or at most 1-bit erasure. The derivation of the optimal decoding region for this channel is given in 
Appendix 11. In general, the 1-bit flipped or erased channel is optimistic compared with the actual bit- level error 
and erasure channel. Hence, when e -|- / < A?^, the optimal decoding region of a 1-bit flipped or erased channel 
serves as an outer bound of the actual decoding region of a mixed error and bit-level erasure channel. 
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C. Finite Cost Performance Analysis 

Consider, the proposed MAS with finite cost, in the simplest case, M = 2. That is, we assign mo = 2 to 
symbols without erasures; if there is 1 bit-level erasure, we assign mi = 1 to each candidate; otherwise, we assign 
mj = 0, i = 2, 3, • • • , m. The decoding region is characterized in the following theorem. 

Theorem 7 Under the proposed MAS with M = 2, the decoding region of RS codes of rate R > 2/3 + 1/N over 
the mixed channel is: 

e<l{N-K+\)-L (41) 

Proof: For R > 2/3+1/-/V, in the worst case, errors and erasures will not overlap. Hence, S = 2{N — e—f/2) 
and C = 3N — /. We must have: 

(a + 1)(2(7V - e - f/2) - a/2{K - 1)) > 3Ar - / (42) 
a{K - 1) < 2{N - e - f/2.) < (a + 1){K - 1), where a is a non-negative integer (43) 

For a = 0, 1, we get contradictions. 

For a > 3, we get trivial bounds. 

For a = 2, we obtain the decoding region: 

e<^{N-K + l)- 1/, for {K - l)/N > 2/3 (44) 



Corollary 4 For RS codes of rate R < 2/3 + 1/N, the decoding region over the mixed channel in Theorem 7 is 
achievable under the proposed MAS. 

Proof: Since T{S) has a monotonically increasing slope, when (41) is satisfied, we must have T{S) > C if 
e + f<N.'lfe + f>N, again due to Corollary 2, the above region is still achievable. ■ 
We can also derive a decoding region of the proposed MAS with any given multiplicity parameter M as follows: 

Theorem 8 Under the proposed MAS with a given multiplicity parameter M, the decoding region in terms of the 
number of errors e and the number of bit-level erasures f (where e + f < N) can be expressed as: 

/ _ a(a + l)(^-lV2 + C 
^ 2 M(a+1) 



nth the cost C ^ \{N - f)M{M + 1) + /^(^ + 1) and a = [ 



The derivation of Theorem 8 is provided in Appendix III. Similarly when e + f > N, the region in Theorem 8 
is still achievable. Though the actual decoding region can be even larger for low rate RS codes, we are not going 
study it in more detail in this paper. 

We give some examples of the decoding region of RS codes over the mixed channel under different decoding 
schemes. In Figure 4, a high rate RS(255, 239) code is considered. The decoding region of BM and GS decoding 
are exactly the same. However, when considering erasures at the bit-level, the decoding region of the proposed 
MAS is significantly larger than BM and GS decoding. For the proposed MAS with M = 2, the simulated bit-level 
region (obtained by numerically checking the sufficient condition for each error and erasure pair), the high rate 
bit-level achievable region as in Theorem 7 and the general achievable region as in Theorem 8 are right on top of 
each other. The maximum number of bit-level erasures the proposed MAS with M = 2 can correct is about 1 .5 
times that of BM and GS decoding at the bit-level and the decoding region of the proposed MAS with M = 2 
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is significantly larger than BM and GS decoding region. In the infinite cost case, the proposed MAS achieves the 
outer bound of the decoding region. The decoding region corresponding to M = oo encompasses the decoding 
region corresponding to M = 2. 



RS(255, 239) decoding region comparison 




Fig. 4. Bit-level Decoding Region of Algebraic Soft Decoding for RS(255, 239) 



In Figure 5, we show the decoding region of a low rate code RS(63, 23). In this case, the high rate achievable 
region in Theorem 7 becomes loose. On the other hand, the general decoding region derived in Theorem 8 still 
coincides with the actual decoding region (by checking the sufficient condition for each error and erasure pair). 
When there is no erasure, the maximum number of errors the proposed MAS can correct is the same as GS 
decoding. Again, since ASD can take advantage of the erasure information at the bit-level, the decoding region of 
the proposed MAS is strictly larger than the decoding region of GS with symbol-level error and erasure decoding. 
When e + f > N in the infinite cost case, the outer bound becomes invalid. However, the achievable region in the 
infinite cost case is still a valid achievable region. 

D. Monotonicity 

In this subsection, we show the monotonicity of the decoding region of the proposed MAS as a function of 
multiplicity parameter M over the mixed channel. It was shown by McEliece in [22], the error correction radius of GS 
algorithm is a monotonic function of multiphcity parameter M. This monotonicity does not hold for ASD algorithms 
in general. However, the monotonicity result is of interest since it justifies that the asymptotical performance analysis 
by letting M — > oo is indeed the "best" achievable result and it also verifies that increasing the cost will lead to at 
least no worse performance. 

We need the following property of the function T{S): 

Lemma 10 T((a -|- l)x) > ^^T{x), if x> K —1 and a is a positive integer. 

Proof: This proof of this lemma is similar to Theorem A-1, (A-9) in [22]. Since x > K 
(l + !-^{K- 1) <x<(l + {K-1) for / = 0, 1, 2, • • • , a - 1 

(a + 1){K - 1) <ax <{a + l + 1){K - 1) 



— 1, we have: 

(46) 
(47) 
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RS{63, 23) decoding region comparison 




-A — BM region 

—I — GS region, IVI = 2 



simulated bit-level region, M = 2 
high rate achievable region, M = 2 
-e — general achievable region, M = 2 
— * — bit-level achievable region, M = 



Fig. 5. Bit-level Decoding Region of Algebraic Soft Decoding for RS(63, 23) 



T{ax) can be computed as: 



T{ax) = {a + l + l) 



'ax-^-^{K-l) 



On the other hand, (a -I- \)x is in the following range as: 



a^ + {l + 2)a + 



a a 
{a + l + l){K-l) <(a + l)a;< {a + l + i){K-l) 



{K-1) 



(48) 



(49) 
(50) 



Since T{S) is a piecewise Unear function with monotonically increasing slope, T{S) > {i + l){S — ^{K — 1)) for 
any non-negative integer i. Hence, we have the following lower bound onT{{a + l)x): 



T{{a+l)x) >{a + l + 2) 
Combining (48) and (51), we have the following: 



, , , a + I + 1 ^ , 
ia+l)x (K-l) 



(51) 



T{{a+l)x)- 



-T{ax) > (a -I- Z -I- 2) 



(a + l)ar-^-tl±l(i<:-l) 



a + 2 



a + l + 1) 



ax - 



2 

a + 1 



{K-l) 



> 



-lx+i^^±l±^iK-l) 



= -[{K -l){a + l + l)-ax]>0 
where the final step in (54) follows by the fact that I > and ax < {K — l)(a -|- Z -|- 1). 



(52) 

(53) 
(54) 



Theorem 9 Over the mixed channel, if a received word is certainly decodable using ASD with multiplicity parameter 
M, it is certainly decodable under multiplicity parameter M + 2 (M has to be even in the proposed MAS), which 
means the performance of ASD under the proposed MAS is monotonic with the multiplicity parameter M. 
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Proof: If a codeword is certainly decodable with the multiplicity parameter M, we have T{S{M)) > C(M), 
where S{M) and C{M) are score and cost with multiplicity M respectively. Considering all types of symbols in 
the received word, we have the following relationship: 

S{M + 2) = ^±^S{M) (55) 

If a received word is certainly decodable, according to Lemma 7, we have S{M) > M{K — 1). Therefore: 

T{S{M + 2)) = r((M + 2)^) 

> C(M + 2) (58) 

where (57) is obtained by applying Lemma 10 twice and (58) is due to (56). ■ 
Note that the monotonicity property holds for aU RS codes regardless of the rate. 

VI. Bit-Level Generalized Minimum Distance Decoding Algorithm 

In this section, we develop a practical SDD algorithm for RS codes, which is motivated by the analytical results 
in the previous sections. 

A. The Generic BGMD Algorithm 

As shown in Section V-C, the proposed MAS has a significantly larger decoding region than conventional BM 
and GS decoding over a mixed error and bit-level erasure channel. This provides the intuition that properly treating 
erasures at the bit-level will also help in RS soft-decision decoding over other channels. An efficient way to utilize 
erasures over many channels is by ordering the reliability values of the received bits, treating the LRB's as erasures 
and running an error and erasure decoder successively, namely generalized minimum distance (GMD) decoding [2J . 
In each iteration, the decoder can decode erasures in the LRB's together with some extra errors in the remaining 
most reliable bits (MRB's) as long as the error and erasure (e, /) pair is within the decoding region of BM algorithm. 
Due to the similarity between the proposed algorithm and conventional symbol-level GMD for RS codes, it is called 
bit-level GMD (BGMD). 

The generic algorithm of BGMD is described in Algorithm 1 . 

Remark 4 In terms of implementation, BGMD does not need to run ASD algorithm many times. In fact, the 
interpolation part can be shared between different erasure patterns. Similar to the techniques proposed in [18]- 
[21], we can generate all the candidate codewords in one interpolation round by applying factorization in the 
intermediate steps during the interpolation procedure. Besides, factorization needs to be performed only at outer 
corner (e, f) points. For high rate RS codes, the number of "test erasure patterns" of BGMD is the same as 
conventional symbol-level GMD. 



B. Performance Analysis of BGMD 

Due to the simple structure of BGMD, the performance of BGMD for practical high rate RS codes over an 
AWGN channel can be tightly bounded using order statistics techniques. Define D(M) as the decoding region of 
the proposed MAS over a mixed bit-level error and erasure channel, namely the set of error and erasure (e, /) 
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Algorithm 1 Bit-level Generalized Minimum Distance Decoding Based on Algebraic Soft Decision Decoding for 

Reed-Solomon Codes 

Stepl. Initialization: set the initial iteration round i = 1 and generate the log likehhood ratio (LLR) for each 

Pic ■ — Ol y ■ ) 

coded bit based on the channel observation yj: Lj = log -pj^r^^^^ i = li 2, • • • , n 
Step2. Reliability Ordering: order the coded bits according to the absolute value of the LLR's { \Lj \ } in ascending 
order and record the ordering indices {sj}. 



Step4. Multiplicity Assignment: 

In each symbol of the received word c, assign multiplicities according to: 

1) if no bit is erased, assign M to the received symbol; 

2) if there is I-bit erasure, assign M/2 to each candidate; 

3) if there is more than 1-bit erasure, assign multiplicity zero. 

StepS. Algebraic Soft Decision Decoding: Run ASD according to the multiplicity assignment determined in 

Step4.. Keep the generated codewords in the decoding list. 
Step6. Erase the Least ReUable i Bits: Cg, = e for Z = 1, • • • ,i, (where e indicates a bit-level erasure). 
Step?. Iteration: If i < n — A; and ASD is still able to correct the current erasures given no error, set i ■*— i -|- 1 

and go to Step4. for another decoding iteration. 
StepS. Final Decision: Output the most likely codeword in the decoding list. If there is no codeword in the hst, 

a decoding failure is declared. 



pairs that is certainly decodable by the proposed MAS with multiphcity parameter M as specified in Theorem 8. 
Let fmax,M and e„iax,M be the maximum number of errors and erasures respectively such that (0, fmax,M) 
{'£-max,M, 0) are still in D{M). The FER of BGMD can be upper bounded by the FER performance of using a set 
of bit-level error and erasure decoders, each with different number of erased bits / in the LRB's and a different 
error correction capability e such that (e, /) e D{M). Note, however, D{M) is the worst case decoding region 
of the proposed MAS, BGMD can in fact correct even more number of errors and erasures if some of the errors 
and erasures overlap in some symbols. However, for high rate RS codes, this upper bound becomes tight, since the 
worst case error and erasure pattern dominates. Performance analysis of BGMD then boils down to bounding the 
performance of a conventional GMD decoder for binary codes [34] with a skewed decoding region D{M). Hence, 
upper bounds of GMD for binary codes, such as the one derived in [34], are directly applicable to evaluating 
the performance of BGMD decoding. For readers' convenience, we give the detailed procedure to compute the 
FER upper bound on BGMD algorithm in Appendix IV. For more comprehensive studies on this bound, we refer 
interested readers to [34] and [35] for appUcations to other order statistics based decoding algorithms. 

Thanks to this upper bound, the performance of BGMD in high SNR's, where RS codes operate in many 
practical systems, can be predicted analytically, which is beyond the capability of computer simulation. As an 
example, performance bound of BGMD over a popular high rate RS(255, 239) is plotted in Figure 6. At an FER = 
10~^^, the upper bound of BGMD with M = 2 has a 0.8dB and 0.3dB gain over conventional BM and GMD upper 
bound respectively. With asymptotically large cost M = oo, the gain of BGMD upper bound over BM increases to 
IdB at an FER = 10~^^. On the other hand, the performance of KV algorithm cannot be simulated at such a low 
FER. Compared with another popular SDD algorithm, i.e., the box and match algorithm (BMA) order- 1 with 22 
bits in the control band [8], the upper bound of BGMD with M = 2 has a 0.2dB gain at this FER level with a much 
smaller complexity and memory consumption than BMA. In high SNR's, the upper bound of BGMD with M = 2 
also has comparable performance to the upper bound of Chase Type-2 Decoding [3] with 16 test error patterns. 
The performance gap of BGMD to a genie decoder with decoding radius t = N — K becomes smaller and smaller 
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as SNR increases. Note that the actual performance of BGMD may be even better than that predicted by the upper 
bound as will be shown in the simulation results in the following section. The FER upper bound on BGMD can 
be further tightened by considering the joint order statistics, which also increases computational complexity. 




E,/N„(dB) 

Fig. 6. Performance Bounds of Algebraic Soft-decision Decoding for RS Code (255, 239) over an AWGN Channel 

The generic BGMD algorithm can also be extended to incorporate Chase type decoding [3], [20], [21]. Under the 
proposed MAS, the corresponding performance can also be tightly upper bounded by sinoilar bounding techniques 
using order statistics as shown in [35]. 

C. Discussions 

We first discuss a counter-intuitive phenomenon of KV decoding, which was first observed in [36]. That is, KV 
decoder may fail even when the received vector does not contain any errors. We give an example as follows: 

Example 1: Consider an RS(255, 239). Suppose in the received vector, no bit is in error. 255 x 7 = 1785 bits 
are perfectly received, i.e., the magnitude of their LLRs are all infinity. In each symbol, there is one bit that is 
corrupted by some noise, but still they have the correct signs. The APP that it is the transmitted bit is 0.7 and the 
probability that it is the wrong bit is 0.3. According to the KV algorithm, i.e., PMAS, we will have S = 178.5M 
and C Ri 73.95M^. It is easy to verify that even though there is no bit in error, the sufficient condition (11) will 
be violated. It can also be verified by actual simulation that KV will fail in some cases even when no bit is in 
error. In fact, this phenomenon was recently reassured in [37]. The analysis in [37] showed that under PMAS, the 
asymptotical decoding radius of ASD might be 0, which suggests the decoder can fail even though there is no error. 

At first glance, this phenomenon seems counter-intuitive. It seems to suggest that soft information even degrades 
the performance. However, from the analysis in previous sections, we can get an intuitive and sensible interpretation. 
ASD in some sense treats weighted erasures, therefore, similar to erasure decoding over AWGN channels, in some 
cases, we may end up erasing too many correct bits and cause a decoding failure even though there is no error. 
On the other hand, since BGMD treats erasures according to the received reUability value and also erases bits 
successively, these abnormal cases will be excluded. 

Besides, in general, the monotonicity of ASD is not guaranteed. For instance, it is observed in [16] that for the 
simplified KV algorithm, the decoding performance does not monotonically improve as the cost increases. For the 
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proposed BGMD, on the other hand, as shown in the previous section, the decoding region will monotonically 
become larger as a function of the multiplicity parameter M. 

The generic BGMD can naturally be generalized to take more than 1-bit erasures into account, which will be 
important in decoding medium to low rate RS codes. The associated performance bounds are also of great research 
interest, since for medium to low rate RS codes, the upper bound considering the worst case bit-level decoding 
region alone becomes loose. 

VII. Simulation Results 

In this section, we show simulation results of the proposed BGMD over various communication channels. We 
will see that the proposed BGMD, though derived from a simple MAS, is superior to many existing MAS's which 
are far more complicated. Besides, in contrast to most MAS's in the literature, the order statistics based upper 
bound can accurately evaluate the actual performance of BGMD for many practical high rate RS codes. We assume 
that binary phase shift keying (BPSK) is used as the modulation format in all the simulations in this paper, since 
in this case bit-level soft information can be generated straightforwardly. The proposed BGMD algorithm can also 
be applied to higher order modulations to provide extra gain over symbol-level soft decision decoding. However, 
this is beyond the scope of this paper and will be discussed in more detail elsewhere. 

In Figure 7, we plot the FER performance of an RS(31, 25) over an AWGN channel. BGMD (M = 2) outperforms 
conventional BM by 1.3dB at an FER = 10~^. It also outperforms conventional symbol-level GMD by 0.6dB at an 
FER = 10~^ and is slightly inferior to Combined Chase and GMD CGA(3), which has a much larger complexity, 
by 0.2dB. Compared with existing MAS's for ASD, it gives favorable performance as well. With M = 2, it even 
outperforms KV algorithm with M = oo by 0.5dB at an FER = 10~^. With M = oo, the performance of BGMD 
outperforms the performance of Gaussian approximation based MAS [23] and the performance of Chernoff technique 
based MAS [24], [38], which are far more compUcated than the proposed BGMD in multiphcity assignment. 




Fig. 7. Algebraic Soft-decision Decoding of RS Code (31, 25) over an AWGN Channel 



In Fig. 8 we evaluate the FER performance of a long code, RS(255,239) code. Again, BGMD (M = 2) 
outperforms GMD and is comparable to CGA(3). As the codeword length increases, KV algorithm becomes 
asymptotically optimal as shown in [15]. The performance of the proposed BGMD is still comparable to KV 
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decoding. In the infinite cost case, tiie performance of BGMD (M = oo) is slightly better than the performance of 
KV (M oo); in the finite cost case, BGMD (M = 2) even outperforms KV (M = 4.99). Besides, since BGMD 
only assigns multipUcities to symbols with at most 1-bit erasure, the memory consumption in storing the assigned 
multiplicities is much smaller than KV. The upper bound is quite tight and it starts to outperform KV (M = 4.99) 
and is only O.ldB inferior to the actual performance at an FER = 10~^. As shown in Figure 6, it gives an estimate 
of the performance of BGMD in high SNR's as well. 



Algebraic Soft Decoding of RS Code (255, 239) over AWGN Channel 




E,/N„(dB) 

Fig. 8. Algebraic Soft-decision Decoding of RS Code (255, 239) over an AWGN Cliannel 

Though the upper bound of BGMD is tight only for medium to high rate RS codes, the proposed BGMD algorithm 
actually provides even more significant coding gain for low rate RS codes. As shown in Figure 9, the performance 
of BGMD (M = 2) can outperform BM, GMD, CGA(3) decoding by a large margin for an RS(63, 12) code over 
an AWGN channel. The gain of BGMD over BM is about 2dB at an FER = IQ-*. In this case, CGA(3) is far 
more inferior to BGMD. BGMD (M = 2) has almost identical performance as KV (M = 4.99). While, in the 
infinite cost case, KV does have a 0.6dB gain over BGMD at an FER = 10~^, which suggests that taking care of 
more than 1 -bit-erased symbols might provide extra gains for low rate RS codes. It is an interesting open problem 
to develop such kind of MAS. 

The gain of the proposed BGMD over BM and CGA becomes larger when the channel is "similar" to a BEC, 
say Rayleigh fast fading channels, since BGMD can correct a significantly larger number of bit-level erasures than 
conventional BM as discussed in Section III. As shown in Figure 10, the gain of BGMD (M = 2) is about 1.5dB 
compared with BM at an FER = 10""^. As expected, the gain of BGMD over CGA(3) is more significant over 
the fading channel. Compared with KV (M = oo), BGMD (M = 2) is slightly inferior to KV (M = oo) in low 
SNR's, but it intersects KV (M = oo) at an FER = 10"'' and performs better in high SNR's. BGMD (M = oo) 
has a 0.75dB gain over KV (M = oo) at an FER = 10~^. The superior performance of BGMD seems to suggest 
that for high rate RS codes, efficiently taking advantage of bit-level erasures exploits most of the gain in ASD. 

Performance of the proposed BGMD is also investigated over practical magnetic recording channels, that is, 
longitudinal channel and perpendicular channel with 90% jitter noise. More details of the channel model can be 
found in [39]. Similar performance gains of BGMD have also been observed over practical recording channels. 
BGMD (M = 2) outperforms conventional GMD and performs competitively with KV and CGA(3), which are 
much more complex. This superior performance of BGMD suggests that though RS codes are usually considered 
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Algebraic Soft Decoding of RS Code (63, 12) over AWGN Ctiannel 
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Fig. 9. Algebraic Soft Decoding of RS Code (63, 12) over an AWGN Channel 

as a powerful burst error correction code, it is still beneficial to taking advantage of soft information at the bit-level 
even over practical magnetic recording channel models, where errors are usually bursty. 

Vlll. Conclusion 

We have presented multiplicity assignment strategies and performance analyses of algebraic soft-decision decoding 
over erasure channels, binary symmetric channels and mixed error and bit-level erasure channels. Performance 
analysis motivates a simple sequential multiplicity assignment scheme, bit-level generalized minimum distance 
decoding. The proposed BGMD outperforms most of the MAS's in the literature for RS codes in a wide range of 
rates over various channels both in terms of performance and complexity. Due to its simplicity, the performance of 
BGMD can also be tightly bounded using order statistics based upper bounds even in high SNR's over an AWGN 
channel. The proposed BGMD has potential apphcations in decoding RS codes in practical recording systems and 
RS outer codes in concatenated systems. 
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Appendix I 

Derivation of the Bit-level Radius of a 1-bit Flipped BSC 

Suppose there are e < N 1-bit flipped symbols. In the MAS, we assign M to the received vector and tM to the 
1-bit flipped neighbors. As M ^ oo, the score and cost are: 

S = {N - e)M + eMt (59) 
C=^[M^{l+mt^){l + o{l))] (60) 
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Algebraic Soft Decoding of RS Code (255, 175) over Rayleigh Fast Fading Channel 



10 




10 



-0— GMD 
-e— CGA(3) 

-* — BGMD, M = 2 
^ BGMD, M=~ 



11 



12 



13 



14 



15 



E,/N„(dB) 



Fig. 10. Algebraic Soft Decoding of RS Code (255, 175) over the Rayleigh Fast Fading Chaimel 



Plugging the score and cost into (11), we get: 



[{N - e) + et]M > ^/{K - 1)N{1 + mf^)]^ 



e < 



(61) 

(62) 



For RS codes of rate R < + -j^, we have {K — 1)(1 + m) < A^. Setting t = 1 in (61), the inequality 
becomes independent of e and is always satisfied. In this case, the transmitted codeword will always be on the list. 
For higher rate RS codes, t is optimized to maximize the right hand side (RHS) of (62). This problem is equivalent 



to maximizing the slope between a given point {1,N) and a point on the hyperbola 



N{K-1) 



mx^ = 1, within 



the range < a; < 1 and y > \/N{K — 1), which is nothing but the tangent to the hyperbola. For the tangential 
point (a;o, j/o). we have the following relationships: 



N{K - 1) 
dx 



mxQ = 1 



N{K-l)m— 

yo 



^ N-yo 
I- xo 

From the above three equations, we can get: 

yo = {K~l){mxo + l) 

Plugging back to (63), we get 

m [m{K -1)-N]xl + 2m{K - 1)xq - [N - {K - 1)] = 



(63) 
(64) 
(65) 

(66) 

(67) 



Since we are only interested in xq € [0, 1], it is easy to verify that in all cases, the solution of (67) will be of the 
following form: 

-m{K -l) + y/A 



Xo = 



m{m{K - 1) - TV) 



(68) 
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ASD Of RS Code (255, 239) over Longitudinal Cliannel, = 2.0, 90% Jitter 
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Fig. 11. Algebraic Soft-decision Decoding of RS Code (255, 239) over Longitudinal Channel, Ds = 2.0, 90% Jitter 



where A = {m{K - 1))^ + {N -K + l){m'^{K - 1) - mN). Note that the singular point m{K - 1) - TV = can 
be removed by taking the Umit: [m{K — 1) — N] ^ 0. Combing (64) and (66), the optimal error correction radius 



is: 



< 



N 



+ 1 



(69) 



where xq is computed in (68). The maximum emax satisfying (69) is the error correction radius of ASD algorithm 
under the proposed asymptotically optimal MAS over 1-bit flipped BSC and t = xo is the optimal multipUcity 
coefficient. 

Moreover, can be further bounded as follows: 



VA<miK- 1)[1 + -i 



iN) 



(70) 



For high rate RS codes, {N - K + iXm^iK - 1) - mN) < {m{K - the left hand side (LHS) and RHS of 
(70) becomes very close and the upper bound on \/A becomes tight. Plug (70) into (68): 

N-{K-1) 



Xo = 



2m{K-l) 



Plug (71) into (69), we finally get: 

N{N -K + l) 
N+{K-l) 



N - ^JN{K -I) 



1 + 



^N{K-l)-{K-l) 
N+{K-1) 



(71) 



(72) 



Note that Cmax > ^max and it serves as an upper bound on the true decoding radius. However, (72) suggests that 
in the 1-bit flipped BSC case, the improvement of ASD over GS algorithm is very Uttle for high rate RS codes. A 
similar result was independently obtained in [28]. 
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ASD Of RS Code (255, 239) over Perpendicular Channel, = 2.0, 90% Jitter 
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Fig. 12. Algebraic Soft-decision Decoding of RS Code (255, 239) over Perpendicular Channel, Ds = 2.0, 90% Jitter 



Appendix II 

Derivation of the Decoding Region of ASD over a 1-bit Flipped or Erased Channel 

We consider the following MAS for the 1-bit flipped or erased channel: if the symbol does not contain erased bits, 
assign multipUcity M to the received symbol and Mti to all 1-bit flipped neighbors. In the 1-bit erased symbols, 
we assign Mt2 to both candidates. 

Suppose we have / erasures and e errors. For a given /, an optimal MAS tries to maximize e. 

In the infinite cost case, the score and the cost are: 



S 

c 



(N- 



f)M + eMti + fMt2 

2 ' mMHl) + 2fMHl] (1 



-0(1)) 



When a received vector is certainly decodable in the infinite cost case, (11) has to be satisfied. We have: 



e < 



N-f{l- t2) - ^{K - 1) [{N - /)(1 + mt\) + 2/i2] 



When / 



0, (75) reduces to (62). Here, we only consider the non-trivial case, / > 0. Define 



Ji 
J 



= TV - /(I - t2) - x/ (if - 1) [(iV - /)(1 + mt\) + 
Ji 



1-ti 

We first maximize J\ with respect to t-2,. Take the derivative, we get: 

aJi ^ 4(ir - i)/t2 

at2 ^ 



9{t2) 



(73) 
(74) 

(75) 

(76) 
(77) 

(78) 



2^{K~l)[{N- f){l + mti) + 2ftf, 
Note that g{t2) is a monotonically decreasing function, with g(0) = / > 0. Note that limt^^oo .9(^2) > when 
/ > 2{K — 1). This suggests the proposed optimal MAS will have t2 — > 00. In this case, S k, fMt2, C « 
fMHlil + 0(1)) and S > ^y2{K - 1)C will always be satisfied if f > 2{K - 1). Therefore, when f > 2{K - 1), 
e + f = N errors and erasures can be recovered for the 1-bit flipped or erased channel, which is optimal. It can 
also be shown that when f = 2{K — 1) < N, e + f = N is also achievable by properly assigning multiplicities 
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to symbols without erasure. This is not too surprising, since the 1-bit erased symbols are guaranteed to be error 
free and therefore, it worth putting more multiplicities on 1-bit erased symbols. For high rate RS codes, we have 
2{K — 1) > N. Hence, g{t2) will have a unique zero in t2 G [0, oo), which maximizes Ji. Set g{t2) — 0, we get: 



_ {N-f){l+mtl) 

- V 4(i^-l)-2/ ^^^^ 



Ji = - / - ^[(i^ - 1) - f/2] (N -/)(! + mtl) (80) 
J can thus be simplified as a function of ti only as: 



J = A.^-f^^ (81) 



where 



A = ^[(K-l)-f/2] (N^f) (82) 



B = ^{N-f)/[{K-l)-f/2\ (83) 



(81) has a similar structure to (62). When f > 2{K - 1) + let h = 1 and = y %K-i)-2f ' 

condition (11) will always be satisfied. 

For / < 2[K — 1) -|- ^^^^i^f-^. we apply the same technique used in (62) here, i.e., to maximize the slope 
between the point (1, B) and a point on the hyperbola t/^ — mx"^ = 1 will give the optimal multiplicity coefficient 
ti: 

—m + 

h,opt = 7 52^ ^°^> 

The optimal J as a function of / is: 

JoM = iN-f) 7^^^'°"* (85) 
Eventually, the optimal decoding region is: 

e < Joptif) (86) 

Any received word from the 1-bit flipped or erased channel with e-bit errors and /-bit erasures satisfying (86) is 
certainly decodable by ASD under the proposed optimal MAS with ti^opt and t2,opt as the optimal multiplicity 
coefficients respectively. 

Appendix III 
Proof of Theorem 8 

Proof: When e + f < N, the cost is 

which does not depend on the number of errors e. T{S), as defined in (10), is a piecewise linear function with 
monotonically increasing slope. Since T{S) is monotonic, we first determine the unique interval where T{S) 
intersects C, i.e., T{a{k - 1)) < C < T((a + 1){K - 1)). Plugging (10) in T{a{K - 1)) < C, we get an upper 
bound on a: 
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with C defined in (87). The integer solution of a is: 



The threshold of the score can then determined by 



« - L \ J (89) 



S* =T-^(C) = ^:^ + -{K-l) (90) 

where T~^(C) is the inverse function of T{S). 

The received word is certainly decodable by ASD if 6" > S*, where S = {N — e — f/2)M. Therefore, we have 
the final decoding region as follows: 

2 M{a+1) 

where C and d are defined in (87) and (89) respectively. ■ 

Appendix IV 

Computation of the Frame Error Rate Upper Bound of BGMD Decoding 

We give a detailed description of the procedure to compute an upper bound on the FER of BGMD decoder. This 
upper bound is an extension of the GMD bound [34] for binary linear block codes with a bounded distance decoder 
(BDD). Without loss of generahty, we assume that the all-zero codeword is transmitted. Assuming that BPSK is 
the modulation scheme and that a zero is mapped to a channel symbol +1, the received value for the ith bit is 
r, = 1 + Hi, where nt AfiO, No/2). 

Let f{x, No) = ^^-^ e be the probability density function (PDF) of a Gaussian random variable (RV) with 
mean zero and variance No/2. Then, the cumulative density function (CDF) of this Gaussian RV is given by: 

/•oo 

Q{x,No)= / f{t,No)dt (92) 
The probabiUty that one bit is in error can therefore be expressed as: 

Pb^Q{l,No). (93) 

Let and be the PDF's of \ri\ given that n < and > 0, respectively. It is shown in [34] that and 
are given by 

where u{x) is a step function. 

Therefore, the corresponding CDF's are: 

Assume there are i erroneous bits in the received vector. Order the received bits according to their reliability 
values in decreasing order. Let Pj{i) be the j*^ ordered reUabiUty value in i erroneous bits. That is > P2{i) > 
■■■> (3i{i). On the other hand, there are n — i correct bits. Define 7; (n — i) as the l*^ value after ordering. We have 
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7i(*) ^ 72(0 > • • • > In-iin — i)- The density of and 7;(n — i) can be derived using the order statistics as 
in [40]: 



(n — i)! 
l)!(n-i-/)! 

Hence, the probabihty that the event > jiin — i)} occurs can be evaluated by the following double integral: 



P i/Sjii) > 7i{n -i))= / Uin-i)ix) / //3,(i)(y) dy dx (100) 

Jo Jx 

The performance of BGMD decoding can be bounded as follows: 

PbGMD < PmL + Plist ~ PList (101) 

Pust can be computed using the first order approximation in [34]. The basic idea is that for a specified number 
of errors in the received vector, the actual FER of BGMD is upper bounded by the FER of an error and erasure 
decoder with a fixed but optimized number of erasures in the LRB's. Pust can be expressed as: 

fmax,M n 

Plist < V Plil-PbT-' min P((e + l)errors in the (iV - /) MRB's) + V P,^(l-P(,)"- 

(102) 

fmax.M n 

J2 Pl{l-PbT-\ min P(/3e+i(i)>7n-e-/(n-i))+ ^ A^l " A)"" 

»=emax,M + l »=/max,M + l 

(103) 

where D{M) is the set of all error and erasure pairs (e, /) that is within the decoding region of the proposed MAS 
for a specified multiplicity parameter Af, as characterized in Theorem 8. fmax.M and emax,M are the maximum 
number of erasures and errors such that fmax,M) and {emax,M,^) still belong to D{M). 
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